526 research outputs found

    Genetic algorithm learning as a robust approach to RNA editing site prediction

    Get PDF
    BACKGROUND: RNA editing is one of several post-transcriptional modifications that may contribute to organismal complexity in the face of limited gene complement in a genome. One form, known as C → U editing, appears to exist in a wide range of organisms, but most instances of this form of RNA editing have been discovered serendipitously. With the large amount of genomic and transcriptomic data now available, a computational analysis could provide a more rapid means of identifying novel sites of C → U RNA editing. Previous efforts have had some success but also some limitations. We present a computational method for identifying C → U RNA editing sites in genomic sequences that is both robust and generalizable. We evaluate its potential use on the best data set available for these purposes: C → U editing sites in plant mitochondrial genomes. RESULTS: Our method is derived from a machine learning approach known as a genetic algorithm. REGAL (RNA Editing site prediction by Genetic Algorithm Learning) is 87% accurate when tested on three mitochondrial genomes, with an overall sensitivity of 82% and an overall specificity of 91%. REGAL's performance significantly improves on other ab initio approaches to predicting RNA editing sites in this data set. REGAL has a comparable sensitivity and higher specificity than approaches which rely on sequence homology, and it has the advantage that strong sequence conservation is not required for reliable prediction of edit sites. CONCLUSION: Our results suggest that ab initio methods can generate robust classifiers of putative edit sites, and we highlight the value of combinatorial approaches as embodied by genetic algorithms. We present REGAL as one approach with the potential to be generalized to other organisms exhibiting C → U RNA editing

    Differentially expressed alternatively spliced genes in Malignant Pleural Mesothelioma identified using massively parallel transcriptome sequencing

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Analyses of Expressed Sequence Tags (ESTs) databases suggest that most human genes have multiple alternative splice variants. The alternative splicing of pre-mRNA is tightly regulated during development and in different tissue types. Changes in splicing patterns have been described in disease states. Recently, we used whole-transcriptome shotgun pryrosequencing to characterize 4 malignant pleural mesothelioma (MPM) tumors, 1 lung adenocarcinoma and 1 normal lung. We hypothesized that alternative splicing profiles might be detected in the sequencing data for the expressed genes in these samples.</p> <p>Methods</p> <p>We developed a software pipeline to map the transcriptome read sequences of the 4 MPM samples and 1 normal lung sample onto known exon junction sequences in the comprehensive AceView database of expressed sequences and to count how many reads map to each junction. 13,274,187 transcriptome reads generated by the Roche/454 sequencing platform for 5 samples were compared with 151,486 exon junctions from the AceView database. The exon junction expression index (EJEI) was calculated for each exon junction in each sample to measure the differential expression of alternative splicing events. Top ten exon junctions with the largest EJEI difference between the 4 mesothelioma and the normal lung sample were then examined for differential expression using Quantitative Real Time PCR (qRT-PCR) in the 5 sequenced samples. Two of the differentially expressed exon junctions (ACTG2.aAug05 and CDK4.aAug05) were further examined with qRT-PCR in additional 18 MPM and 18 normal lung specimens.</p> <p>Results</p> <p>We found 70,953 exon junctions covered by at least one sequence read in at least one of the 5 samples. All 10 identified most differentially expressed exon junctions were validated as present by RT-PCR, and 8 were differentially expressed exactly as predicted by the sequence analysis. The differential expression of the AceView exon junctions for the ACTG2 and CDK4 genes were also observed to be statistically significant in an additional 18 MPM and 18 normal lung samples examined using qRT-PCR. The differential expression of these two junctions was shown to successfully classify these mesothelioma and normal lung specimens with high sensitivity (89% and 78%, respectively).</p> <p>Conclusion</p> <p>Whole-transcriptome shotgun sequencing, combined with a downstream bioinformatics pipeline, provides powerful tools for the identification of differentially expressed exon junctions resulting from alternative splice variants. The alternatively spliced genes discovered in the study could serve as useful diagnostic markers as well as potential therapeutic targets for MPM.</p

    The Germ Cell Nuclear Proteins hnRNP G-T and RBMY Activate a Testis-Specific Exon

    Get PDF
    The human testis has almost as high a frequency of alternative splicing events as brain. While not as extensively studied as brain, a few candidate testis-specific splicing regulator proteins have been identified, including the nuclear RNA binding proteins RBMY and hnRNP G-T, which are germ cell-specific versions of the somatically expressed hnRNP G protein and are highly conserved in mammals. The splicing activator protein Tra2β is also highly expressed in the testis and physically interacts with these hnRNP G family proteins. In this study, we identified a novel testis-specific cassette exon TLE4-T within intron 6 of the human transducing-like enhancer of split 4 (TLE4) gene which makes a more transcriptionally repressive TLE4 protein isoform. TLE4-T splicing is normally repressed in somatic cells because of a weak 5′ splice site and surrounding splicing-repressive intronic regions. TLE4-T RNA pulls down Tra2β and hnRNP G proteins which activate its inclusion. The germ cell-specific RBMY and hnRNP G-T proteins were more efficient in stimulating TLE4-T incorporation than somatically expressed hnRNP G protein. Tra2b bound moderately to TLE4-T RNA, but more strongly to upstream sites to potently activate an alternative 3′ splice site normally weakly selected in the testis. Co-expression of Tra2β with either hnRNP G-T or RBMY re-established the normal testis physiological splicing pattern of this exon. Although they can directly bind pre-mRNA sequences around the TLE4-T exon, RBMY and hnRNP G-T function as efficient germ cell-specific splicing co-activators of TLE4-T. Our study indicates a delicate balance between the activity of positive and negative splicing regulators combinatorially controls physiological splicing inclusion of exon TLE4-T and leads to modulation of signalling pathways in the testis. In addition, we identified a high-affinity binding site for hnRNP G-T protein, showing it is also a sequence-specific RNA binding protein

    Alternative splicing enriched cDNA libraries identify breast cancer-associated transcripts

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Alternative splicing (AS) is a central mechanism in the generation of genomic complexity and is a major contributor to transcriptome and proteome diversity. Alterations of the splicing process can lead to deregulation of crucial cellular processes and have been associated with a large spectrum of human diseases. Cancer-associated transcripts are potential molecular markers and may contribute to the development of more accurate diagnostic and prognostic methods and also serve as therapeutic targets. Alternative splicing-enriched cDNA libraries have been used to explore the variability generated by alternative splicing. In this study, by combining the use of trapping heteroduplexes and RNA amplification, we developed a powerful approach that enables transcriptome-wide exploration of the AS repertoire for identifying AS variants associated with breast tumor cells modulated by <it>ERBB2</it> (<it>HER-2/neu</it>) oncogene expression.</p> <p>Results</p> <p>The human breast cell line (C5.2) and a pool of 5 ERBB2 over-expressing breast tumor samples were used independently for the construction of two AS-enriched libraries. In total, 2,048 partial cDNA sequences were obtained, revealing 214 alternative splicing sequence-enriched tags (ASSETs). A subset with 79 multiple exon ASSETs was compared to public databases and reported 138 different AS events. A high success rate of RT-PCR validation (94.5%) was obtained, and 2 novel AS events were identified. The influence of <it>ERBB2</it>-mediated expression on AS regulation was evaluated by capillary electrophoresis and probe-ligation approaches in two mammary cell lines (Hb4a and C5.2) expressing different levels of <it>ERBB2</it>. The relative expression balance between AS variants from 3 genes was differentially modulated by <it>ERBB2</it> in this model system.</p> <p>Conclusions</p> <p>In this study, we presented a method for exploring AS from any RNA source in a transcriptome-wide format, which can be directly easily adapted to next generation sequencers. We identified AS transcripts that were differently modulated by <it>ERBB2</it>-mediated expression and that can be tested as molecular markers for breast cancer. Such a methodology will be useful for completely deciphering the cancer cell transcriptome diversity resulting from AS and for finding more precise molecular markers.</p

    Gene expression and splicing alterations analyzed by high throughput RNA sequencing of chronic lymphocytic leukemia specimens.

    Get PDF
    BackgroundTo determine differentially expressed and spliced RNA transcripts in chronic lymphocytic leukemia specimens a high throughput RNA-sequencing (HTS RNA-seq) analysis was performed.MethodsTen CLL specimens and five normal peripheral blood CD19+ B cells were analyzed by HTS RNA-seq. The library preparation was performed with Illumina TrueSeq RNA kit and analyzed by Illumina HiSeq 2000 sequencing system.ResultsAn average of 48.5 million reads for B cells, and 50.6 million reads for CLL specimens were obtained with 10396 and 10448 assembled transcripts for normal B cells and primary CLL specimens respectively. With the Cuffdiff analysis, 2091 differentially expressed genes (DEG) between B cells and CLL specimens based on FPKM (fragments per kilobase of transcript per million reads and false discovery rate, FDR q &lt; 0.05, fold change &gt;2) were identified. Expression of selected DEGs (n = 32) with up regulated and down regulated expression in CLL from RNA-seq data were also analyzed by qRT-PCR in a test cohort of CLL specimens. Even though there was a variation in fold expression of DEG genes between RNA-seq and qRT-PCR; more than 90 % of analyzed genes were validated by qRT-PCR analysis. Analysis of RNA-seq data for splicing alterations in CLL and B cells was performed by Multivariate Analysis of Transcript Splicing (MATS analysis). Skipped exon was the most frequent splicing alteration in CLL specimens with 128 significant events (P-value &lt;0.05, minimum inclusion level difference &gt;0.1).ConclusionThe RNA-seq analysis of CLL specimens identifies novel DEG and alternatively spliced genes that are potential prognostic markers and therapeutic targets. High level of validation by qRT-PCR for a number of DEG genes supports the accuracy of this analysis. Global comparison of transcriptomes of B cells, IGVH non-mutated CLL (U-CLL) and mutated CLL specimens (M-CLL) with multidimensional scaling analysis was able to segregate CLL and B cell transcriptomes but the M-CLL and U-CLL transcriptomes were indistinguishable. The analysis of HTS RNA-seq data to identify alternative splicing events and other genetic abnormalities specific to CLL is an added advantage of RNA-seq that is not feasible with other genome wide analysis

    Characterization of the mouse Dazap1 gene encoding an RNA-binding protein that interacts with infertility factors DAZ and DAZL

    Get PDF
    BACKGROUND: DAZAP1 (DAZ Associated Protein 1) was originally identified by a yeast two-hybrid system through its interaction with a putative male infertility factor, DAZ (Deleted in Azoospermia). In vitro, DAZAP1 interacts with both the Y chromosome-encoded DAZ and an autosome-encoded DAZ-like protein, DAZL. DAZAP1 contains two RNA-binding domains (RBDs) and a proline-rich C-terminal portion, and is expressed most abundantly in the testis. To understand the biological function of DAZAP1 and the significance of its interaction with DAZ and DAZL, we isolated and characterized the mouse Dazap1 gene, and studied its expression and the subcellular localization of its protein product. RESULTS: The human and mouse genes have similar genomic structures and map to syntenic chromosomal regions. The mouse and human DAZAP1 proteins share 98% identity and their sequences are highly similar to the Xenopus orthologue Prrp, especially in the RBDs. Dazap1 is expressed throughout testis development. Western blot detects a single 45 kD DAZAP1 protein that is most abundant in the testis. Although a majority of DAZAP1 is present in the cytoplasmic fraction, they are not associated with polyribosomes. CONCLUSIONS: DAZAP1 is evolutionarily highly conserved. Its predominant expression in testes suggests a role in spermatogenesis. Its subcellular localization indicates that it is not directly involved in mRNA translation

    Signature of multilayer growth of 2D layered Bi2Se3 through heteroatom-assisted step-edge barrier reduction

    Get PDF
    During growth of two-dimensional (2D) materials, abrupt growth of multilayers is practically unavoidable even in the case of well-controlled growth. In epitaxial growth of a quintuple-layered Bi2Se3 film, we observe that the multilayer growth pattern deduced from in situ x-ray diffraction implies nontrivial interlayer diffusion process. Here we find that an intriguing diffusion process occurs at step edges where a slowly downward-diffusing Se adatom having a high step-edge barrier interacts with a Bi adatom pre-existing at step edges. The Se???Bi interaction lowers the high step-edge barrier of Se adatoms. This drastic reduction of the overall step-edge barrier and hence increased interlayer diffusion modifies the overall growth significantly. Thus, a step-edge barrier reduction mechanism assisted by hetero adatom???adatom interaction could be fairly general in multilayer growth of 2D heteroatomic materials

    Hypoxia inducible factor 1α gene (HIF-1α) splice variants: potential prognostic biomarkers in breast cancer

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Hypoxia-inducible factor 1 (HIF-1) is a master transcriptional regulator of genes regulating oxygen homeostasis. The HIF-1 protein is composed of two HIF-1α and HIF-1β/aryl hydrocarbon receptor nuclear translocator (ARNT) subunits. The prognostic relevance of HIF-1α protein overexpression has been shown in breast cancer. The impact of HIF-1α alternative splice variant expression on breast cancer prognosis in terms of metastasis risk is not well known.</p> <p>Methods</p> <p>Using real-time quantitative reverse transcription PCR assays, we measured mRNA concentrations of total <it>HIF-1α </it>and 4 variants in breast tissue specimens in a series of 29 normal tissues or benign lesions (normal/benign) and 53 primary carcinomas. In breast cancers <it>HIF-1α </it>splice variant levels were compared to clinicopathological parameters including tumour microvessel density and metastasis-free survival.</p> <p>Results</p> <p><it>HIF-1α </it>isoforms containing a three base pairs TAG insertion between exon 1 and exon 2 (designated <it>HIF-1α</it><sup><it>TAG</it></sup>) and <it>HIF-1α</it><sup><it>736 </it></sup>mRNAs were found expressed at higher levels in oestrogen receptor (OR)-negative carcinomas compared to normal/benign tissues (<it>P </it>= 0.009 and <it>P </it>= 0.004 respectively). In breast carcinoma specimens, lymph node status was significantly associated with <it>HIF-1α</it><sup><it>TAG </it></sup>mRNA levels (<it>P </it>= 0.037). Significant statistical association was found between tumour grade and <it>HIF-1α</it><sup><it>TAG </it></sup>(<it>P </it>= 0.048), and total <it>HIF-1α </it>(<it>P </it>= 0.048) mRNA levels. <it>HIF-1α</it><sup><it>TAG </it></sup>mRNA levels were also inversely correlated with both oestrogen and progesterone receptor status (<it>P </it>= 0.005 and <it>P </it>= 0.033 respectively). Univariate analysis showed that high <it>HIF-1α</it><sup><it>TAG </it></sup>mRNA levels correlated with shortened metastasis free survival (<it>P </it>= 0.01).</p> <p>Conclusions</p> <p>Our results show for the first time that mRNA expression of a <it>HIF-1α</it><sup><it>TAG </it></sup>splice variant reflects a stage of breast cancer progression and is associated with a worse prognosis.</p> <p>See commentary: <url>http://www.biomedcentral.com/1741-7015/8/45</url></p

    Correlating changes in lung function with patient outcomes in chronic obstructive pulmonary disease: a pooled analysis

    Get PDF
    Background Relationships between improvements in lung function and other clinical outcomes in chronic obstructive pulmonary disease (COPD) are not documented extensively. We examined whether changes in trough forced expiratory volume in 1 second (FEV1) are correlated with changes in patient-reported outcomes. Methods Pooled data from three indacaterol studies (n = 3313) were analysed. Means and responder rates for outcomes including change from baseline in Transition Dyspnoea Index (TDI), St. George's Respiratory Questionnaire (SGRQ) scores (at 12, 26 and 52 weeks), and COPD exacerbation frequency (rate/year) were tabulated across categories of ΔFEV1. Also, generalised linear modelling was performed adjusting for covariates such as baseline severity and inhaled corticosteroid use. Results With increasing positive ΔFEV1, TDI and ΔSGRQ improved at all timepoints, exacerbation rate over the study duration declined (P < 0.001). Individual-level correlations were 0.03-0.18, but cohort-level correlations were 0.79-0.95. At 26 weeks, a 100 ml increase in FEV1 was associated with improved TDI (0.46 units), ΔSGRQ (1.3-1.9 points) and exacerbation rate (12% decrease). Overall, adjustments for baseline covariates had little impact on the relationship between ΔFEV1 and outcomes. Conclusions These results suggest that larger improvements in FEV1 are likely to be associated with larger patient-reported benefits across a range of clinical outcomes
    corecore